Reinforcement learning for multi-item retrieval in the puzzle-based storage system

نویسندگان

چکیده

• A deep reinforcement learning algorithm is proposed for multi-item retrieval in the PBS system. compact integer programming model built to evaluate solution quality. conversion handle simultaneous movement. decomposition framework designed large-scale instances. The effect of several factors investigated deduce managerial insights. Nowadays, fast delivery services have created need high-density warehouses. puzzle-based storage system a practical way enhance density, however, facing difficulties process. In this work, algorithm, specifically Double&Dueling Deep Q Network, developed solve problem with general settings, where multiple desired items, escorts, and I/O points are placed randomly. Additionally, we propose Extensive numerical experiments demonstrate that approach can yield high-quality solutions outperforms three related state-of-the-art heuristic algorithms. Furthermore, movement instances respectively, thus improving applicability

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reducing Retrieval Time in Automated Storage and Retrieval System with a Gravitational Conveyor Based on Multi-Agent Systems

The main objective of this study is to reduce the retrieval time of a list of products by choosing the best combination of storage and retrieval rules at any time. This is why we start by implementing some storage rules in an Automated Storage/Retrieval System (Automated Storage and Retrieval System: AS/RS) fitted with a gravity conveyor while some of these rules are dedicated to storage and ot...

متن کامل

An Integrated Model for Storage Location Assignment and Storage/Retrieval Scheduling in AS/RS system

An integrated optimization framework, including location assignment under grouping class-based storage policy and schedule of dual shuttle cranes, is offered by presenting a new optimization programming model. The objective functions, which are considered at this level, are the minimization of total costs and energy consumption. Scheduling of dual shuttle cranes among specified locations, which...

متن کامل

Retrieval–travel-time model for free-fall-flow-rack automated storage and retrieval system

Automated storage and retrieval systems (AS/RSs) are material handling systems that are frequently used in manufacturing and distribution centers. The modelling of the retrieval–travel time of an AS/RS (expected product delivery time) is practically important, because it allows us to evaluate and improve the system throughput. The free-fall-flow-rack AS/RS has emerged as a new technology for dr...

متن کامل

Hypervolume-Based Multi-Objective Reinforcement Learning

Indicator-based evolutionary algorithms are amongst the best performing methods for solving multi-objective optimization (MOO) problems. In reinforcement learning (RL), introducing a quality indicator in an algorithm’s decision logic was not attempted before. In this paper, we propose a novel on-line multi-objective reinforcement learning (MORL) algorithm that uses the hypervolume indicator as ...

متن کامل

An Online Q-learning Based Multi-Agent LFC for a Multi-Area Multi-Source Power System Including Distributed Energy Resources

This paper presents an online two-stage Q-learning based multi-agent (MA) controller for load frequency control (LFC) in an interconnected multi-area multi-source power system integrated with distributed energy resources (DERs). The proposed control strategy consists of two stages. The first stage is employed a PID controller which its parameters are designed using sine cosine optimization (SCO...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: European Journal of Operational Research

سال: 2023

ISSN: ['1872-6860', '0377-2217']

DOI: https://doi.org/10.1016/j.ejor.2022.03.042